Fine-scale estimation of location of birth from genome-wide single-nucleotide polymorphism data.

نویسندگان

  • Clive J Hoggart
  • Paul F O'Reilly
  • Marika Kaakinen
  • Weihua Zhang
  • John C Chambers
  • Jaspal S Kooner
  • Lachlan J M Coin
  • Marjo-Riitta Jarvelin
چکیده

Systematic nonrandom mating in populations results in genetic stratification and is predominantly caused by geographic separation, providing the opportunity to infer individuals' birthplace from genetic data. Such inference has been demonstrated for individuals' country of birth, but here we use data from the Northern Finland Birth Cohort 1966 (NFBC1966) to investigate the characteristics of genetic structure within a population and subsequently develop a method for inferring location to a finer scale. Principal component analysis (PCA) shows that while the first PCs are particularly informative for location, there is also location information in the higher-order PCs, but it cannot be captured by a linear model. We introduce a new method, pcLOCATE, which is able to exploit this information to improve the accuracy of location inference. pcLOCATE uses individuals' PC values to estimate the probability of birth in each town and then averages over all towns to give an estimated longitude and latitude of birth using a fully Bayesian model. We apply pcLOCATE to the NFBC1966 data to estimate parental birthplace, testing with successively more PCs and finding the model with the top 23 PCs most accurate, with a median distance of 23 km between the estimated and the true location. pcLOCATE predicts the most recent residence of NFBC1966 individuals to a median distance of 47 km. We also apply pcLOCATE to Indian individuals from the London Life Sciences Prospective Population Study (LOLIPOP) data, and find that birthplace is predicated to a median distance of 54 km from the true location. A method with such accuracy is potentially valuable in population genetics and forensics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian Estimation of the Inbreeding Coefficient for Single Nucleotide Polymorphism Using Complex Survey Data

Title of Document: BAYESIAN ESTIMATION OF THE INBREEDING COEFFICIENT FOR SINGLE NUCLEOTIDE POLYMORPHISM USING COMPLEX SURVEY DATA Zhenyi Xue, Doctor of Philosophy, 2015 Directed By: Professor Partha Lahiri Associate Professor Yan Li Joint Program in Survey Methodology In genome-wide association studies (GWAS), single nucleotide polymorphism (SNP) is often used as a genetic marker to study gene-...

متن کامل

DNA Polymorphisms at Candidate Gene Loci and Their Relation with Milk Production Traits in Murrah Buffalo (Bubalus bubalis)

DNA polymorphism within diacylglycerol transferase 2 (DGAT2) / monoacyl glycerol transferases 2 (MOGAT2), leptin and butyrophilin genes were analysed using PCR-SSCP in Murrah buffalo. The single strand conformation polymorphism (SSCP) analysis of amplified gene fragment in exon 5 of MOGAT2, exon 3 of leptin and intron 1 of butyrophilin gene revealed different patterns. A, B and C showed the fol...

متن کامل

Run of Homozygosity a Procedure to Detecting Inbreeding in Farm Animals

Inbreeding depression is a harmful phenomenon in livestock which is outcome of inbreeding. Inbreeding is consequence mating between two individuals who are more related to each other than average relatedness in population, which results in reducing in fitness of progenies and genetic variability in populations. Development of high-density genome-wide single nucleotide polymorphism (SNP) array f...

متن کامل

Genome-wide survey of single-nucleotide polymorphisms reveals fine-scale population structure and signs of selection in the threatened Caribbean elkhorn coral, Acropora palmata

The advent of next-generation sequencing tools has made it possible to conduct fine-scale surveys of population differentiation and genome-wide scans for signatures of selection in non-model organisms. Such surveys are of particular importance in sharply declining coral species, since knowledge of population boundaries and signs of local adaptation can inform restoration and conservation effort...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetics

دوره 190 2  شماره 

صفحات  -

تاریخ انتشار 2012